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Abstract 

Background: Tlie tliermopliilic anaerobe Clostridium tliermocellum is a candidate consolidated bioprocessing (CBP) 
biocatalyst for cellulosic ethanol production. It is capable of both cellulose solubilization and its fermentation to 
produce lignocellulosic ethanol. Intolerance to stresses routinely encountered during industrial fermentations may 
hinder the commercial development of this organism. A previous C tliermocellum ethanol stress study showed that 
the largest transcriptomic response was in genes and proteins related to nitrogen uptake and metabolism. 

Results: In this study, C tliermocellum was grown to mid-exponential phase and treated with furfural or heat to a 
final concentration of 3 g.L"^ or 68°C respectively to investigate general and specific physiological and regulatory 
stress responses. Samples were taken at 10, 30, 60 and 120 min post-shock, and from untreated control 
fermentations, for transcriptomic analyses and fermentation product determinations and compared to a published 
dataset from an ethanol stress study. Urea uptake genes were induced following furfural stress, but not to the same 
extent as ethanol stress and transcription from these genes was largely unaffected by heat stress. The largest 
transcriptomic response to furfural stress was genes for sulfate transporter subunits and enzymes in the sulfate 
assimilatory pathway, although these genes were also affected late in the heat and ethanol stress responses. 
Lactate production was higher in furfural treated culture, although the lactate dehydrogenase gene was not 
differentially expressed under this condition. Other redox related genes such as a copy of the rex gene, a 
bifunctional acetaldehyde-CoA/alcohol dehydrogenase and adjacent genes did show lower expression after furfural 
stress compared to the control, heat and ethanol fermentation profiles. Heat stress induced expression from 
chaperone related genes and overlap was observed with the responses to the other stresses. This study suggests 
the involvement of C tliermocellum genes with functions in oxidative stress protection, electron transfer, 
detoxification, sulfur and nitrogen acquisition, and DNA repair mechanisms in its stress responses and the use of 
different regulatory networks to coordinate and control adaptation. 

Conclusions: This study has identified C. tliermocellum gene regulatory motifs and aspects of physiology and gene 
regulation for further study. The nexus between future systems biology studies and recently developed genetic 
tools for C tliermocellum offers the potential for more rapid strain development and for broader insights into this 
organism's physiology and regulation. 
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Background 

Processing biomass for biochemical conversion of plant 
cell wall polysaccharides for fuel production by fermen- 
tation requires a pretreatment step that often involves 
an acid hydrolysis at high temperatures [1,2]. The disad- 
vantage of this treatment is the release of fermentation 
inhibitors such as the sugar degradation product furfural 
[3,4]. Depending on the fermentative microorganism, 
the release of these inhibitors can affect an organisms 
ability to produce ethanol and even to grow [3,4]. A fur- 
ther consideration in choosing an organism for biofuel 
production is tolerance to fermentation end products 
such as ethanol. Metabolic engineering to facilitate redir- 
ection of carbon flow towards increased ethanol titer 
does mean the organism used for this process needs to 
have a high tolerance to the desired solvent. 

The thermophilic anaerobe Clostridium thermocellum is 
of interest due to its ability to convert biomass cellulose to 
ethanol [5,6]. C. thermocellum produces a large extracellu- 
lar protein complex called the cellulosome which is highly 
active on plant cell wall polysaccharides [7-9]. The native 
production of enzymes required for cellulose deconstruc- 
tion is economically advantageous for industrial purposes, 
thus circumventing the costly addition of non-native hy- 
drolytic enzymes. Recent advances for C. thermocellum in- 
clude genome sequences for a number of strains [10-13], 
strategies and strains to delete genes [14-17], charac- 
terization of CRISPR elements [18], genetic and mechanis- 
tic insights into ethanol tolerance [10], as well as carbon 
recovery [19]. A number of systems biology studies have 
been conducted with C. thermocellum in the last several 
years, including; transcriptomic profiles for batch crystal- 
line cellulose fermentations [20], chemostat growth with 
either cellobiose or crystalline cellulose at different dilu- 
tion rates [21], proteomic analysis of cellulosomes [22,23], 
proteomic analysis of core metabolism [24], and an "omics" 
analysis of ethanol stress [25]. In addition, a genome-scale 
metabolic model has been developed [26] and several re- 
gulatory systems have been characterized [27-29] . 

In this study, we have compared the effects on the C. 
thermocellum transcriptome upon heat, or furftiral expos- 
ure and compared this to a recently published dataset of 
ethanol exposure. This was undertaken to understand the 
global effects of these stressors and to contrast a physical 
stress such as the well characterized microbial response to 
high temperature with the chemical stress imposed by 
ethanol or furfural treatment. Ultimately by revealing how 
this organism is affected by these stressors, we aim to add 
valuable knowledge to the way this organism could be de- 
veloped and utilized for applied industrial goals. 

Results 

The time point in mid-exponential growth when the test 
fermentations were exposed to stress was designated as 



"time zero". Samples were either referred to as "control" 
for untreated control fermentations or heat or furfural 
shock (or treatment) for those derived from fermenta- 
tions that had an increased temperature 68°C or 3 g.L'^ 
furfural treatment respectively at "time zero". 

C. thermocellum growth response 

The application of each shock treatment negatively 
influenced C. thermocellum growth (Figure 1). The cul- 
ture turbidity of ODeoonm units for the experimental fer- 
mentors prior to the administration of shock treatment 
was similar to the control fermentors and ranged from 
0.57-0.62 (Figure 1). Growth rates (1/time (h'^)) leading 
up to the administration of each stressor treatment were 
0.25, 0.30, 0.26 and 0.29 for the furftiral shock, heat 
shock, ethanol and control fermentor respectively. 
ODeoonm measurements taken periodically (up to 150 
min) post shock demonstrated the negative effect each 
shock had on growth with averaged growth rates (1/time 
(h'^)) across the monitored 120 min stress period declin- 
ing to 0.15, 0.05 and 0.07 for furfural, heat and ethanol 
treatment respectively relative to the relatively stable 
growth rate of 0.31 for the control fermentations across 
the same period. Growth rates for the ethanol shock ex- 
periment were calculated from data reported previously 
[25]. Heat shock caused a decrease in culture turbidity, 
although this occurred 30 min after the temperature of 
the fermentor reached 68°C. Furfural and ethanol both 
caused a plateau in growth immediately following ad- 
dition of each stress, and growth began to resume in 
both conditions towards the end of the monitored 
period. Analysis of fermentation products illustrates that 
less than 50% of carbon consumed as cellobiose across 
the 120 min sampling period in the control fermenta- 
tions could be recovered as acetate, ethanol or lactate 
(or assumed CO2) (Table 1). Previous studies have 
shown that C. thermocellum carbon recoveries can be 
variable and carbon balances can be made of products 
that were not measured as part of this study [19,24]. A 
larger proportion of carbon would likely have been re- 
covered if the fermentations were allowed to transition 
to stationary phase with end point sampling once the 
substrate was exhausted. The batch fermentors used 
in this study were open systems so any carbon lost as 
CO2, or volatile products were not measured although 
we assume in our calculations that a mole of CO2 
was produced per mole of acetic acid and ethanol 
(Table 1). Carbon recovery for furfural increased to 
almost 100% across the 120 min post stress period 
for furfural, due to a reduction in cellobiose con- 
sumption as cell growth plateaued and was coupled 
with a dramatic rise in lactate production relative to 
the control (0.3 g.L'^ produced in the furfural stress 
fermentor compared to 0.008 g.L"^). 
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No ethanol was produced during the stress period 
after furfural addition and acetate production dropped 
relative to the control After heat stress, the cellobiose 
consumption did not decrease substantially relative to 
the control, however, the production of acetate and 
ethanol decreased relative to the control with some 
additional lactate produced, thus suggesting an alter- 
native outlet for carbon occurred in C thermocellum 
after this particular stress. Furfural was applied at mid- 
exponential phase to a final concentration of 3 g.L'^ and 
detected after its application. At the first sampling time 
point, 10 min post addition, 2.87 g.L'^ of furfural was 
measured (S.D. 0.04 g.L'^) and at the final timepoint 
2.24 g.L"^ (S.D. 0.06 g.L'^) of furfural was detected. This 



decrease could potentially be attributed to metabolic 
conversion of furfural to furfuryl alcohol as demons- 
trated in Cupriavidus basilensis [30] and Escherichia coli 
[31], and evaporation may be another factor that was 
not accounted for in this study. 

Transcriptomic profile of C. thermocellum in response to 
stress exposure 

Gene expression profiles for the control, furfural or 
heat-treated fermentations were generated from samples 
harvested at time 10, 30, 60 and 120 min post-shock 
using NimbleGen microarrays including an existing 
ethanol stress dataset [25]. Comparison to gene expres- 
sion profiles derived from untreated control fermentations 



Table 1 C. thermocellum cellobiose consumption and fermentation products across each 120 min treatment period 

Growth condition Cellobiose Acetate Ethanol Lactate % Carbon 

consumed* g.L'^ produced* g.L'^ produced* g.L'^ produced* g.L'^ recoveryt 

Across the stress period (0-120 min post stress application) 

Heat Stress Fermentation 1 .26 (±0.39) 0.1 8 (±0.04) 0.06 (±0.1 8) 0.01 4 (±0.006) 30.7 

Furfural Stress Fermentation 0.48 (±0.12) 0.18 (±0.01) -0.04 (±0.004) 0.30 (±0.010) 97.7 

Control Fermentation 1.35 (±0.62) 0.29 (±0.13) 0.12 (±0.04) 0.008 (±0.005) 47.0 

Across the whole experiment (post inoculation to 120 min post stress application) 

Control Fermentation 3.02 (±0.15) 0.47 (±0.12) 0.20 (±0.01) 0.02 (±0.003) 35.02 
^Values are the mean of duplicate fermentations and the standard error of the mean given in parentheses. 

tPercentage carbon recovery was calculated using the following equation ffci+ci = ^^etl^-t])^ described in [19,49] where /?C2+C3 carbon recovered from 
the C2 and C3 fermentation products. [A], [E], and [L] are the molar concentrations of acetate [A], ethanol [E], and lactate [L]. The factor of 3 in the numerator 

accounts for the expected formation of 1 mole of CO2 and/or formate produced per mole of ethanol and acetate as previously described [19]. [So] is the initial 
substrate concentration and [5/] is the final substrate concentration. 
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permitted the identification of stress induced transcrip- 
tional changes. When filtered by a significance threshold 
ofp < 0.05, 3056/3067 genes were considered significantly 
different in at least a single time point for a given stress 
condition. Using a cut off of 2-fold difference in gene ex- 
pression between the treated and untreated control, 779, 
731 and 683 genes were affected by ethanol, furfural or 
heat treatment respectively during at least one time point 
sampled, with 1,474 genes significantly (p < 0.05) diffe- 
rentially expressed (± 2 fold relative to the control fermen- 
tations) in at least one of the stress studies (Additional 
file 1: Table SI). The microarray dataset has been de- 
posited in NCBI GEO under the accession number 
GSE40402. 

A subset of genes that were differentially expressed 
120 min post heat shock and furfural shock relative to 
the control profiles were selected for real-time quantita- 
tive PGR (RT-qPGR) confirmation. The ethanol shock 
microarray was previously validated [25]. Gorrelation co- 
efficient values of = 0.96 and 0.93 were obtained for 
comparisons between the microarray and RT-qPGR ex- 
pression profiles for heat and furfural shock respectively 
(Additional file 2), indicating the microarray data were 
of good quality. 

A comparison of the affected genes under each of the 
stress conditions is summarized in Figure 2 with genes 



categorized by Glusters of Orthologous Groups (GOG) 
[32]. One hundred and forty three genes were differen- 
tially expressed under all conditions tested. The majority 
of these shared genes (71), were assigned to the General 
Function Prediction or Hypothetical Protein Prediction 
categories. The largest category with a functional predic- 
tion was Gellular Processes with twenty-three genes 
assigned to this group. Within Gellular Processes nine 
genes had predicted functions in Inorganic Iron Trans- 
port and Metabolism and five were involved in Gell 
Envelope Biogenesis. Additional file 1: Table S2 lists the 
143 genes differentially expressed in all the stress condi- 
tions. Gonsistent up-regulation occurred in the eleven 
genes between Gthe_2532 and Gthe_2542 that include 
genes encoding components of a putative sulfate trans- 
porter (Gthe_ 2532, 2533, 2534) and genes related to the 
sulfur assimilatory pathway. Gonversion of sulfite to the 
biologically relevant sulfide may be catalyzed by the en- 
zyme sulfite reductase (Gthe_2541) in a reaction depen- 
dent on three molecules of NAD PH. Sulfite reductase 
requires siroheme as a cofactor [33] and other genes 
within this genomic region encode proteins involved in 
vitamin B12 metabolism, porphyrin and siroheme bio- 
synthesis. Related to vitamin B12 metabolism, were the 
genes Gthe_2784-2789 encoding methyltransf erases, 
ferredoxin and cobalamin B12 binding proteins up- 



Heat shock (68X) 



Hypothetical 

General function prediction only 
Cellular Processes 
Amino acid transport and nnetabolism 
Carbohydrate transport and nnetabolism 
Chromatin structure and dynamics, DNA 
replication, recombination and repair 
Energy production and conversion 
Transcription, RNA processing and modification 
Translation, ribosomal structure and biogenesis 
Nucleotide transport and metabolism 
Signal transduction mechanisms 
Coenzyme metabolism 

Lipid metabolism 



All stress conditions 



General function prediction only 

Hypothetical 

Cellular Processes 

Energy production and conversion 

Coenzyme metabolism 

Transcription, RNA processing and modification 

Translation, ribosomal structure and biogenesis 

Amino acid transport and metabolism 

Chromatin structure and dynamics, DNA replication, 

recombination and repair 

Carbohydrate transport and metabolism 

Nucleotide transport and metabolism 

Signal transduction mechanisms 

Lipid metabolism 




Furfural shock (3 g.L-^) 



General function prediction only 

Hypothetical 

Cellular Processes 

Amino acid transport and metabolism 

Chromatin structure and dynamics, DNA replication, 

recombination and repair 

Carbohydrate transport and metabolism 

Translation, ribosomal structure and biogenesis 

Signal transduction mechanisms 

Transcription, RNA processing and modification 

Nucleotide transport and metabolism 

Coenzyme metabolism 

Energy production and conversion 

Lipid metabolism 

Membrane Transport 



Ethanol shock (3.9 g.L-^) 



Hypothetical 

General function prediction only 
Cellular Processes 

Translation, ribosomal structure and biogenesis 
Amino acid transport and metabolism 
Transcription, RNA processing and modification 
Energy production and conversion 
Signal transduction mechanisms 

Chromatin structure and dynamics, DNA replication, recombination 
and repair 

Carbohydrate transport and metabolism 
Nucleotide transport and metabolism 
Coenzyme metabolism 

Lipid metabolism 



Figure 2 Comparison of transcriptome response of C. thermocellum 27405 to ethanol, furfural and heat shock relative to untreated 
control fermentations. Genes were categorized into functional groups based on COG assignment and listed for those genes exclusively affected 
in each treatment or affected in all stresses. A through G categories for each Venn segment is used to cross reference to Additional file 1 : Table SI as 
listed in the column titled Venn diagram code. 
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regulated after all treatments, which suggests an in- 
creased need for vitamin B12 and corrinoid containing 
proteins upon exposure to each of these stresses. Other 
genes affected by the three stress treatments with pos- 
sible redox balance functions included two NADH dehy- 
drogenases (Cthe_0429, Cthe_3023) and an annotated 
CO dehydrogenase (Cthe_2801). The CO dehydrogenase 
formed a putative operon with an ABC transporter of 
unknown function (Cthe_2802-2804) and was affected 
by all stress treatments. Genes encoding the molecular 
chaperones DnaK (Cthe_1322) and GrpE (Cthe_1323) 
were up-regulated after all stressor treatments. 

A subset of genes that were most differentially ex- 
pressed (ten up regulated and ten down regulated genes) 
for each of the stress conditions is given in Table 2. 
Genes were ranked based on differential gene expression 
for each of the time points and the ten highest and ten 
lowest for each time point were given a score, lowest be- 
ing the genes most down- regulated and highest being 
genes most up-regulated. The first 10 min time period 
for heat stress was excluded in this analysis due to few 
genes passing the significance threshold of p < 0.05. This 
is most likely due to a delay in the cells response to heat 
stress as seen from the growth curve where the fer- 
mentation exhibits an effect from 30 min post stress 
application. Gene expression patterns after heat shock 
include the up-regulation of central carbon metabolic 
genes such as Cthe_3116 encoding a mannose-6- 
phosphate isomerase and Cthe_2449 encoding one of 
the six C. thermocellum ATCC 27405 phosphoglycer- 
ate mutase enzymes. Phosphoglycerate mutase was 
shown to be highly expressed as part of the E, coli heat 
shock response and was suggested to reflect the in- 
creased energy demand for the synthesis of heat shock 
proteins [34]. Cthe_3125 encoding the heat shock protein 
Hsp20 was up-regulated after furfural shock and was one 
of the most highly up-regulated genes after heat shock, 
with the highest level of expression occurring 30 min 
after the temperature stabilized at 68°C. Other heat spe- 
cific responses were a down-regulation of the gene 
Cthe_0665 and Cthe_0666 encoding the membrane 
bound HflK and HflC proteins. Homologs of the latter 
two proteins have a role in sensing heat stress in E, coli 
and triggering a heat shock response via the regulation 
of the FtsH protease [35]. A gene (Cthe_2276) encoding 
an FtsH protein was down- regulated in heat stress thus 
suggesting at least one signaling cascade for sensing 
environmental stress in C. thermocellum. 

A region of genes related to redox balance 
(Cthe_0422-0432) were the most down-regulated genes 
in response to furfural exposure including the redox re- 
sponse regulator Rex (Cthe_0422), an NADH hydrogen- 
ase (Cthe_0427), and a Fe-hydrogenase (Cthe_0430). 
Within this genomic region is the alcohol dehydrogenase 



gene (Cthe_0423), which has been identified as import- 
ant in C. thermocellum ethanol tolerance as a shift in co- 
factor specificity occurs from NADH to NADPH [10] 
indicating cellular redox status is associated with the 
functioning of this enzyme. Two of the most down- 
regulated genes after furfural shock were Cthe_0943 and 
Cthe_0944, which were part of a larger region of 
Cthe_0940-0953 specifically down-regulated after fur- 
fural treatment. 

Ethanol stress effects on gene regulation in C. 
thermocellum 27405 have been described [25]. Briefly, 
genes related to nitrogen metabolism were up-regulated 
including the subunits of the urease enzymes and related 
accessory proteins in the genomic region of Cthe_1816- 
1823, glutamine synthetase (Cthe_1539), and glutamate 
synthase (Cthe_0198). While, ribosomal proteins and 
acetate kinase were down-regulated in response to etha- 
nol as previously noted [25] . 

A hierarchical clustering of the 1,474 genes that were 
significantly differentially expressed by at least 2-fold be- 
tween a stress condition and the control in at least a sin- 
gle time point was divided into 10 clusters (Figure 3, 
Additional file 1: Table SI). Clusters 1 through 4 grouped 
those genes that were in general up-regulated after expos- 
ure to stress while Cluster 8 included the majority of 
genes consistently down-regulated after all the treatments. 
Cluster 4 consisted of sixteen genes up-regulated consist- 
ently after furfural exposure and in the latter time points 
of heat and ethanol exposure including those related to 
sulfur transport and assimilation and Cthe_280 1-2804 en- 
coding a carbon monoxide dehydrogenase subunit and 
transporter as mentioned above. Cluster 10 featured fifty- 
nine genes including the genes Cthe_0422-0432 men- 
tioned above, that were subjected to a rapid and sustained 
down- regulation after exposure to furfural. Heat shock 
tended to result in gene down-regulation while ethanol 
exposure had little effect on the Cluster 10 genes (Figure 3, 
Additional file 1: Table SI). 

Cellulosome genes 

Fifty-six genes associated with the cellulosome were dif- 
ferentially regulated in at least one of the three stress 
conditions (Additional file 1: Table SI). Seven cellu- 
losome genes were consistently up-regulated in at least 
one time point after stress including the glycoside 
hydrolases (Cthe_0745 Cthe_2590). These were recently 
shown to be affected by growth rate where higher ex- 
pression from these loci occurred at lower growth rates 
[21]. These expression patterns would therefore reflect 
the decreased growth rates in the batch fermentations 
once stress had been applied. It is plausible that other 
cellulosome related genes with consistent increased ex- 
pression across the stress treatments were also a growth 
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Table 2 Subset of genes with the highest (n = 10) and lowest (n = 10) differential expression to each of the treatments 
compared to control fermentations 

Locus tag Functional annotation Ratio: control - stress^ Venn code 

10 min 30 min 60 min 120 min 

Furfural stress 



Cthe_ 


_0211 


Glycoside hydrolase family 16 


-2.22 


-3.56 


-2.41 


-2.87 


D 


Cthe_ 


J 643 


Phage-associated protein 


-2.12 


-4.49 


-2.41 


-2.77 


F 


Cthe_ 


J 644 


Hypothetical protein 


-2.26 


-4.07 


-2.27 


-2.79 


F 


Cthe_ 


_2531 


Sulfate ABC transporter, periplasmic sulfate-binding protein 


-1.87 


-3.19 


-3.58 


-3.99 


F 


Cthe_ 


_2532 


Sulfate ABC transporter, inner membrane subunit CysT 


-1.90 


-2.92 


-3.07 


-3.55 


D 


Cthe_ 


_2533 


Sulfate ABC transporter, inner membrane subunit CysW 


-1.77 


-2.82 


-3.29 


-3.46 


D 


Cthe_ 


_2535 


Adenylylsulfate reductase, thioredoxin dependent 


-1.54 


-2.86 


-3.27 


-3.45 


D 


Cthe_ 


_2962 


Oligopeptide/dipeptide ABC transporter, ATPase subunit 


-1.87 


-3.04 


-2.15 


-2.83 


F 


Cthe_ 


_2963 


Oligopeptide/dipeptide ABC transporter, ATPase subunit 


-1.96 


-3.33 


-2.15 


-2.59 


F 


Cthe_ 


_3125 


Heat shock protein Hsp20 


-2.60 


-3.21 


-2.49 


-2.30 


B 


Cthe_ 


_0422 


Redox-sensing transcriptional repressor rex 


2.98 


3.47 


2.81 


3.48 


B 


Cthe_ 


_0424 


Aminoglycoside phosphotransferase 


1.98 


3.14 


2.23 


2.93 


B 


Cthe_ 


_0425 


Hypothetical protein 


2.49 


4.07 


3.23 


3.84 


D 


Cthe_ 


_0426 


Fe-S cluster domain protein 


2.47 


3.86 


2.82 


3.60 


B 


Cthe_ 


_0427 


Stage II sporulation protein E 


2.31 


4.20 


2.46 


3.42 


D 


Cthe_ 


_0428 


NADH dehydrogenase (ubiquinone) 24 kDa subunit 


3.08 


5.49 


3.05 


4.61 


F 


Cthe_ 


_0430 


Hydrogenase, Fe-only 


1.79 


4.83 


2.90 


4.06 


F 


Cthe_ 


_0431 


Hypothetical protein 


2.24 


5.64 


3.15 


4.74 


D 


Cthe_ 


_0943 


Hypothetical protein 


2.53 


3.30 


2.63 


2.86 


C 


Cthe_ 


_0944 


SMC domain protein 

Heat stress 


2.20 


2.88 


2.32 


2.63 


C 


Cthe_ 


J 309 


Radical SAM domain protein 


0.14 


-1.94 


-2.37 


-2.14 


A 


Cthe_ 


J 743 


Protein of unknown function DUF955 


-1.89 


-2.77 


-2.58 


-2.72 


D 


Cthe_ 


J 746 


Hypothetical protein 


-0.33 


-1.99 


-2.98 


-1.82 


D 


Cthe_ 


J 851 


Protein of unknown function DUFl 1 13 


-0.30 


-2.84 


-2.29 


-1.76 


D 


Cthe_ 


_2448 


ABC-type transporter, integral membrane subunit 


-0.40 


-2.82 


-2.19 


-1.66 


A 


Cthe_ 


_2449 


Phosphoglycerate mutase 


-0.03 


-2.44 


-2.04 


-1.84 


A 


Cthe_ 


_3054 


Hypothetical protein 


0.01 


-2.75 


-2.03 


-2.33 


B 


Cthe_ 


_3112 


Glycosidase related protein 


-0.71 


-2.61 


-2.52 


-1.71 


B 


Cthe_ 


_3116 


Mannose-6-phosphate isomerase, class 1 


-0.13 


-2.78 


-2.84 


-2.11 


B 


Cthe_ 


_3125 


Heat shock protein Hsp20 


-0.06 


-4.07 


-2.80 


-2.74 


B 


Cthe_ 


_0323 


Hypothetical protein 


-0.44 


1.72 


2.06 


1.49 


B 


Cthe_ 


_0539 


ABC transporter related 


-0.23 


2.10 


2.18 


2.23 


B 


Cthe_ 


_0665 


HflK protein 


0.13 


2.47 


2.95 


2.03 


A 


Cthe_ 


_0938 


Regulatory protein DeoR 


-0.20 


2.22 


2.53 


1.96 


D 


Cthe_ 


J 390 


Alpha/beta hydrolase fold 


1.26 


3.66 


2.91 


2.75 


D 


Cthe_ 


J 921 


Transcriptional regulator PadR family protein 


1.03 


1.77 


2.94 


2.65 


A 


Cthe_ 


J 922 


Hypothetical protein 


0.56 


1.41 


2.64 


2.43 


A 
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Table 2 Subset of genes with the highest (n = 10) and lowest (n = 10) differential expression to each of the treatments 
compared to control fermentations (Continued) 



Cthe_2266 


Vacuolar H+transporting two-sector ATPase F subunit 


-0.03 


1.49 


2.58 


1.87 


E 


Cthe_2956 


Hypothetical protein 


0.54 


1.86 


1.96 


1.61 


B 


Cthe_2957 


Hypothetical protein 


0.53 


2.01 


1.56 


1.57 


B 



^Negative values indicate greater gene expression in the treatment samples while positive values indicate lower gene expression in the treatment samples 
compared to the control fermentations. Boldface indicates the differential gene expression was statistically significant (p < 0.05). 



rate effect rather than a reaction to the application of 
each of the stressors. 

Transcriptional regulators 

Sixty-one transcriptional regulators were differentially ex- 
pressed in at least one stress condition (Additional file 1: 
Table S3). Four regulators, Cthe_1619, Cthe_1745, 
Cthe_2808 and Cthe_2969 were consistently up-regulated 
in the three stress conditions relative to the control and 
are located next to genes also up-regulated. Cthe_1745 
was up-regulated in all conditions and located in the re- 
gion Cthe_1743-1750. This region was up-regulated after 
all treatments relative to the control fermentations and 
includes genes coding DNA mismatch repair enzymes, 



restriction endonucleases and genes functionally anno- 
tated as hypothetical Cthe_2808, a Lad repressor was 
up-regulated after all stresses and its repression in C. 
thermocellum has been shown to be alleviated by 
laminaribiose [27,36]. The genes (Cthe_2807 and 
Cthe_2809) that the Lad regulator is known to repress 
were up-regulated after each of the stress treatments, al- 
though these genes were just below the 2 -fold threshold 
for differential expression after furfural treatment. This 
suggests that the expression of these genes can be 
influenced by other environmental stimuli besides the 
presence of laminaribiose. 

Other regulators of interest included Cthe_2524 which 
is located upstream of the sulfate transporter mentioned 



Cluster 1 

Cluster 2 

Cluster 3 
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Cluster 5 

Cluster 6 

Cluster 7 
Cluster 8 

Cluster 9 
Cluster 10 
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60 120 
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Time (min) 
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Figure 3 Hierarchical clustering of the 1,474 genes significantly (p > 0.05) differentially expressed in the treatments relative to the 
control C. thermocellum fermentations over the four sampling time points analyzed in this analysis. Genes were grouped into ten clusters 
based on responses to ethanol, furfural and heat treatments. Red indicates higher expression relative to the control, green represents down 
regulation of gene expression relative to the control. 
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above and was down- regulated after all stress treat- 
ments. The putative phosphate uptake regulator PhoU, 
encoded by Cthe_1601, was up-regulated under ethanol 
stress, down-regulated after heat stress and unaffected 
by furfural treatment and is located beside the phosphate 
transport components. The housekeeping sigma factor 
sigma 70 (Cthe_2521), required for transcriptional initi- 
ation, was down-regulated under all stress conditions. 
The gene Cthe_1792, encoding a CtsR regulator or Fir- 
micute transcriptional repressor of Class III stress genes 
was up-regulated 120 min post heat treatment. 

Regulatory motif analysis 

A motif identification strategy was undertaken to iden- 
tify cis regulatory motifs in the promoter regions of 
genes identified as differentially expressed in at least one 
of the stress analyses. In total, 120 cis regulatory motifs 
were identified by a phylogenetic comparison framework 
using 39 Clostridium strains and 17,328 different C. 
thermocellum ATCC27405 genomic regions of interest 
were identified (Additional file 1: Table S4). The 120 
motifs are further clustered into fifty-four groups regard- 
ing their patterns' similarity (Motif cluster ID column. 
Additional file 1: Table S4) and were used for a co- 
expression analysis to identify those genes in the stress 
studies that potentially formed an operon and were down- 
stream from a particular motif. Six motifs (Additional 
file 3: Table S5) were located in the promoter regions 
of genes that were co-regulated and investigated fur- 
ther to identify potential regulators that had respon- 
ded to heat, furfural or ethanol stress. We highlight 
several motifs below and provide the others as sup- 
plemental material. 

Genes with the mapped Motif 29 in general responded 
to stress treatments similarly, while genes with Motif 51 
tended to respond to ethanol stress more than heat or 
furfural. Motif 29 (Additional file 3: Table S5) appeared 
upstream of 14 genes, five of which encoded a putative 
sulfate transporter (Cthe_253 1-2535) that appeared to be 
co-expressed and potentially form an operon. Motif 29 
was most similar to a sequence recognized by IscR (Iron 
sulfur cluster Regulator) from Rickettsials. A BLASTP 
search using the IscR amino acid sequence from Clostrid- 
ium acetobutylicum ATCC 824 as the query sequence 
against C. thermocellum 27405 found one of the top hits 
(ID 70/208, 99% sequence coverage) in C. thermocellum 
as Cthe_2524, currently annotated as a BadM/Rrf2 fam- 
ily transcriptional regulator and located upstream of the 
sulfate transporter Cthe_2531-2535. Cthe_2524 was down- 
regulated after all stress treatments and may have a nega- 
tive regulatory role in the transcription of the downstream 
sulfate transport genes. Interestingly, Motif 29 occurs up- 
stream of a gene encoding Hsp20 (Cthe_3125) and could 
indicate the sulfate transport system up-regulation is part 



of a general stress response coordinated with the classical 
heat shock response. Motif 51 (Additional file 3: Table S5) 
was identified upstream of 16 genes, encompassing three 
regions of putative co-expression. Two cellulosome related 
dockerins (Cthe_0729 and Cthe_0246) and a glycosyl 
transferase were both induced by ethanol treatment and 
had Motif 51 upstream of the transcriptional start site. The 
consensus regulatory motif sequence is most likely recog- 
nized by a member of the AraC family of regulators with 
one AraC regulator (Cthe_2634) down-regulated within 
10 min after ethanol stress. 

Discussion 

This study sought to investigate the effect of furftiral 
and heat stress on the physiology of the candidate CBP 
organism C. thermocellum and compare this to a previ- 
ously published dataset based on ethanol stress [25] . 

Under the conditions used in this study, both heat and 
furfural stress caused a reduction in the fermentation 
growth rate compared to the untreated control and a de- 
crease in the main fermentation products ethanol and 
acetate occurred as a result (Table 1). Acetyl- Co A is a 
branch point in C thermocellum metabolism where car- 
bon flow is directed towards either of two main fermen- 
tation products, acetate or ethanol [6]. The decreased 
growth of the cells following stress treatment likely re- 
flects the impact to cellular physiology via a reduction in 
available cellular ATP as the reactions catalyzed by 
phosphotransacetylase and acetate kinase generates ATP 
and acetate [6]. Phosphotransacetylase (PTA; Cthe_1029) 
and acetate kinase (Cthe_1028) were down-regulated after 
all treatments, although PTA was not significantly down- 
regulated after heat shock (Additional file 1: Table SI). 
Acetyl-CoA can be converted to acetylaldehyde by the ac- 
tion of aldehyde dehydrogenase encoded by Cthe_2238 
[20,24] or the activity of a bifunctional alcohol/aldehyde 
dehydrogenase (Cthe_0423) [10]. The former gene was not 
differentially regulated after heat shock or ftirftiral shock; 
however Cthe_0423, encoding the enzyme capable of cata- 
lyzing both acetyl-CoA to acetylaldehyde and acetylalde- 
hyde to ethanol was dramatically down-regulated after 
ftirftiral shock from 10-120 min post shock by log2 ratios 
ranging from 1.91 to 2.60 (in Additional file 1: Table SI). 
By 120 min post heat shock, Cthe_0423 was down-regu- 
lated by a log2 ratio of 2.57 (Additional file 1: Table SI). 
These changes of expression are reflected in the amount 
of ethanol detected across the stress period (Table 1). No 
net ethanol was produced after furfural treatment and an 
apparent reduction occurred after heat stress. The detect- 
able levels for ethanol after heat treatment did vary and 
we cannot rule out the effect of increased evaporation 
on ethanol concentrations as the temperature of the fer- 
mentor was raised. Lactate, typically a minor fermenta- 
tion product was dramatically increased after furfural 
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treatment (Table 1) despite no differences identified in 
the expression of the gene encoding lactate dehydrogen- 
ase (Cthe_1053) under this condition (Additional file 1: 
Table SI). The pathway to lactate has no net loss of car- 
bon to CO2, hence the increased proportion of carbon 
recovery compared to the control fermentations, and it 
also generates an NAD which may help rebalance electron 
flow after furftiral stress. The application of fermentation 
end-products can induce shifts in C. thermocellum strain 
ATCC 27405 metabolism and corresponding enzyme 
levels do not necessarily compare with differences in end- 
product yields [37], as was also the case for some key 
genes in this study. 

Patterns of differential expression in the shock relative 
to the control indicated that the response to furfural is 
rapid and is maintained for a prolonged period of time. 
Previous work in Saccharomyces cerevisiae has shown 
that furfural tolerance requires a functional pentose 
phosphate pathway, and causes an accumulation of re- 
active oxygen species [38,39]. The redox status of the 
cell after furfural exposure is likely to be considerably 
disturbed and some of the most down-regulated genes 
after exposure to furftiral were those in the genomic re- 
gion around the gene (Cthe_0423) encoding the NADH 
dependent AdhE [10] and the Rex transcriptional repres- 
sor (Cthe_0422). Growth inhibition caused by furfural 
exposure has been shown to result from NAD(P)H de- 
pletion as the reduction of furfural to the less toxic 
furfuryl alcohol is catalyzed by NAD(P)H dependent oxi- 
doreductases on NAD(P)H, thus competing with NAD 
(P)H dependent alcohol dehydrogenases and impeding 
alcohol production [40,41]. This is a likely explanation 
for the increased lactate production and the complete 
halt in ethanol production when C. thermocellum was 
treated with furftiral. Furfural toxicity effects have also 
been demonstrated to impact in the assimilation of sul- 
fur, as sulfite reductase is reliant on NADPH, which 
leads on to effects in the biosynthesis of cysteine and 
methionine [40]. The C. thermocellum Cthe_2801 gene 
encodes a putative carbon monoxide dehydrogenase and 
this enzyme may play a role in dealing with redox imbal- 
ance associated with the oxidative stress from furfural 
exposure, or the more extreme stress imposed by the 
conditions used in this study. One report has shown that 
CO inhibits C. thermocellum hydrogenases and can in- 
crease the ratio of ethanol to acetate production [42]. How- 
ever, no evidence exists as to whether C. thermocellum can 
utilize the substrates of this enzyme, CO or the reverse re- 
action, CO2 as a carbon source. 

The effect of raising the temperature of the fermentor 
from 58°C to 68°C resulted in a drop in culture turbidity 
30 min after the temperature was stabilized at the higher 
temperature (Figure 1). Heat shock proteins are con- 
served and in B, subtilis the heat shock response genes 



are split into six classes, four of which I, III, IV and V 
were affected by the stress treatments (See review [43]). 
An induction of the C. thermocellum genes homologous 
to the B, subtilis Class I heat shock response genes oc- 
curred after all stress treatments. These included the genes 
encoding DnaK (Cthe_1322), the GroES domain protein 
(Cthe_2445), and genes in the dnaK operon (Cthe_1321- 
Cthe_1324). The dnaK operon in C. thermocellum has a 
similar architecture to that in B, subtilis [43] and includes 
the genes encoding HrcA repressor, GrpE, DnaK, DnaJ. 
Class III heat stress response genes are regulated by the 
CtsR regulator. The CtsR regulator in C. thermocellum 
27405 is most likely encoded by the gene Cthe_1792, 
currently annotated as the Firmicute transcriptional 
repressor of class III stress genes (Additional file 1: 
Table SI). This gene was up regulated 120 min after 
heat stress (Additional file 1: Table SI). Interestingly, 
knockout of the ctsR gene in Lactobacillus sakei 
bypassed the lag phase during fermentation [44] and 
could provide a direction for metabolic engineering of 
the C. thermocellum strain. The Class IV heat shock 
regulon has a single member, HtpG/Hsp90. The gene 
Cthe_0550 is annotated as encoding Hsp90 and was 
up regulated after all stress treatment (Additional file 1: 
Table SI). Overexpression of htpG was recently shown 
to enhance butanol tolerance and eventual strain adap- 
tation to butanol in C. acetobutylicum [45] and could 
provide a route for enhanced solvent resistance in C. 
thermocellum. 

Conclusions 

In summary, transcriptome analyses revealed a global 
view of the responses of the C. thermocellum to the 
challenge of two stressors, furfural and heat, for the 
first time. Many genes were considered differentially 
expressed in response to two or more of the stress expo- 
sures, and 325, 257, and 321 genes were considered re- 
sponsive to heat, furftiral, or ethanol stress specifically 
using the stringency criteria applied in this study. This 
study suggests the involvement of C. thermocellum genes 
with functions in oxidative stress protection, electron 
transfer, detoxification, sulfur and nitrogen acquisition, 
and DNA repair mechanisms in the stress responses and 
the use of different regulatory networks to coordinate 
and control adaptation. This study has identified C. 
thermocellum gene regulatory motifs and aspects of 
physiology and gene regulation for further study. 

Methods 

Controlled batch fermentations 

C. thermocellum ATCC27405 was cultured in MTC 
medium with 5 g.L'^ final concentration of cellobiose as 
the carbon source at 58°C as described previously [46,47]. 
For fermentor inoculation C. thermocellum 27405 was 
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grown overnight in 50 mL MTC, which was used to inocu- 
late a seed fermentor as described previously [25]. The seed 
fermentor was used to inoculate the experimental batch fer- 
mentors to a starting ODeoonm of approximately 0.1. Batch 
fermentations were performed in 7.5-L BioFlollO bioreac- 
tors (New Brunswick Scientific, NJ) fitted with agitation, 
pH and temperature probes and controls with a total 
volume of approximately 4.0 L of MTC medium [48]. 
Fermentors were sparged with filter-sterilized N2 gas to 
maintain anaerobic conditions. During fermentations, 
the agitation rate of the vessel was maintained at 300 
rpm and pH was maintained at pH 7.0 by automatic ti- 
tration with 3 N NaOH as described previously [25]. 

Fermentations were conducted for the wild-type C. 
thermocellum controls (no added stress), a 3.9 g.L'^ etha- 
nol shock treatment, a 3 g.L'^ furfural shock or a 68°C 
heat shock in duplicate. The heat shock increase in 
temperature from 58°C occurred over a five minute period 
and the first time point sample was taken once the culture 
temperature had reached 68°C. Growth was monitored by 
measuring ODeoonm with a model 8453 spectrophotom- 
eter (Hewlett-Packard, CA). Samples were harvested at ap- 
proximately mid-exponential phase (ODeoonm ~ 0-5) and 
at different time points post treatment. 

Fermentation product analyses with high-performance 
liquid chromatography (HPLC) 

HPLC analysis was used to measure the extracellular 
metabolite concentration of cellobiose, acetate, lactate, 
ethanol and furfural in 0.2 (im-filtered samples taken at 
different time points during fermentation, as described 
previously [48]. The fermentation samples were acidified 
with 2 M sulfuric acid, separated and quantified by 
HPLC using a LaChrom Elite System (Hitachi High 
Technologies America, Inc., CA). Analysis was performed 
with the oven (Model L-2350) set at 60°C, and a pump 
(Model L-2130) set with a flow rate of 0.5 mL.min'^ in 
5 mM H2SO4 as described previously [48]. The run time 
for each sample was set for 35 min or extended to 80 
minutes for 20-fold diluted samples for measurement of 
furfural (Injector Model L-2200). Fluted compounds were 
registered and quantified by a refractive index detector 
(Model L-2490) interfaced to a computer. Soluble fermen- 
tation products were identified by comparison with reten- 
tion times and peak areas of corresponding standards. 
Metabolites were separated on an Aminex HPX-87H, 
300 X 7.8 mm column (Bio-Rad, CA). Percentage carbon 
recoveries were calculated based on cellobiose con- 
sumed and the amount of ethanol, acetate and lactate 
produced [49]. 

RNA extraction and ds-cDNA synthesis 

Cells were harvested in two 50 ml aliquot samples taken 
from controlled batch fermentations by centrifugation. 



snap frozen and stored at -80°C. Cell pellets from one 
tube were resuspended in TRIzol reagent (Invitrogen, 
CA), disrupted and RNA extracted as described previously 
[25]. Cell lysis involved bead beating with 0.25 g of lysis 
beads from an UltraClean Microbial RNA Kit (MO BIO 
Laboratories, Inc, CA) at 6,500 rpm for three 20 s treat- 
ments in a Precellys 24 high-throughput tissue homoge- 
nizer (Bertin Technologies, Montigny-le-Bretonneux, 
France). Cell lysates were transferred to fresh tubes and 
purified as described previously [25,48] . Total RNA prepa- 
rations were DNasel treated (Ambion, TX) and purified 
using a RNeasy mini kit (Qiagen, CA). RNA quantity was 
determined by NanoDrop ND-1000 spectrophotometer 
(NanoDrop Technologies, DE) and RNA quality was 
assessed with Agilent Bioanalyzer (Agilent, CA). Purified 
RNA of high quality was used as the template to generate 
ds-cDNA using Invitrogen ds-cDNA synthesis kit accor- 
ding to the manufacturers protocols (Invitrogen, CA). 

Microarray sample labeling, hybridization, scan, and 
statistical analysis of array data 

The ds-cDNA was labeled, hybridized and washed 
according to the NimbleGen protocols as described previ- 
ously [25]. Hybridizations were conducted using a 12-bay 
hybridization station (BioMicro Systems, Inc., UT) and 
the arrays dried using a Maui wash system (BioMicro Sys- 
tems, Inc.). Microarrays were scanned with a Surescan 
high- resolution DNA microarray scanner (5 (im) (Agilent 
Technologies, CA), and the images were quantified using 
NimbleScan software (Roche NimbleGen, IN). Raw data 
was log2 transformed and imported into the statistical 
analysis software JMP Genomics 6.0 software (SAS Insti- 
tute, NC). The data of the three stress studies and the 
controls were normalized together using a single round of 
the LOESS normalization algorithm within JMP Genom- 
ics and distribution analyses were conducted before and 
after normalization as a quality control step. An ANOVA 
was performed in JMP Genomics to determine differential 
expression levels between conditions and time points 
using the False Discovery Rate (FDR) testing method 
{p < 0.05). One way Hierarchical clustering of the ratios 
(Control vs. Stress) of significantly differentially expressed 
genes were performed in JMP Genomics using the default 
settings of ten clusters. 

Real-Time quantitative-PCR (RT-qPCR) analysis 

Microarray data were validated using real-time qPCR, as 
described previously [25]. Genes representing a range of 
gene expression values based on microarray hybridiza- 
tions were analyzed using qPCR from cDNA derived 
from different time point samples. Oligonucleotide se- 
quences of the primers targeting the genes selected for 
qPCR analysis are fisted in Additional file 4: Table S4. 
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Regulatory motif analysis 

Genes that were considered significantly differentially 
expressed by microarray were analyzed by BoBro and a 
motif analysis toolkit to identif)^ potential cis regulatory 
motifs [50,51]. These putative motifs were evaluated 
by a phylogenetic footprinting framework utilizing 39 
Clostridium genome sequences available in public data- 
bases. This identified 120 putative motifs that were well 
conserved across the strains and were thus more likely 
to be actual cis regulatory elements. Furthermore, using 
the clustering algorithm MCL [52], the 120 motifs were 
clustered into 54 groups based on the similarity score 
between any pair of motifs (use 0.45 as the similarity 
score cutoff). For each one out of the 54 motif patterns, 
the co-expression property of downstream operons was 
assessed by a popular biclustering algorithm, QUBIC 
[48]. The motif consensus sequences were mapped to 
known prokaryotic cis regulatory elements databases 
[53] to determine whether the predicted motif patterns 
were real cis regulatory elements in other prokaryotic 
genomes. 



Additional files 



particular motif; Target cor^sensus, sequer^ce identified by proposed 
regulator in Regulator Name column. 

Additional file 4: Table 56. Primer sequences used for Real Time qPCR 
validation of the microarray results. 
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